18.785: Analytic Number Theory, MIT, spring 2007 (K.S. Kedlaya) 
Primes in eirithmetic progressions 



In this unit, we first prove Dirichlet's theorem on primes in arithmetic progressions. We 
then prove the prime number theorem in arithmetic progressions, modulo some exercises. 



1 Dirichlet's theorem 

For short, 1 will say an arithmetic progression is eligible if it has the form m,m -\- N,m -\- 
2N, . . . where gcd(m, A^) = 1; it is equivalent to ask that any two consecutive terms are 
relatively prime. 

Theorem 1 (Dirichlet). Any eligible arithmetic progression of positive integers contains 
infinitely many primes. 

There are a few special cases where one can prove this directly, but otherwise algebraic 
methods cannot touch this problem. Dirichlet's idea was to prove, in some appropriate quan- 
titative sense, that the primes distribute themselves equally among the eligible arithmetic 
progressions with a particular difference; this goes back to Euler's proof of the infinitude of 
primes using the Riemann zeta function. 



2 Asymptotic density and Dirichlet density 

In order to speak quantitatively about the distribution of certain types of primes, or integers 
in general, we need some sort of measure theory on the set of primes or the set of integers. 
Note that Lebesgue-type measure theory is not an option for countable sets: we can only 
hope to make finitely additive measures. 

For S C. T two sets of positive integers, with T infinite, the upper natural density and 
lower natural density oi S in T are defined as 

#{neS:n<N} . , #{n e S : n < N} 

lim sup —7 — — V , hm mi —7 — — r- . 

JV-.00 ^{n eT ■.n<N}' n^oo #{n E T : n < N} 

Of course the upper density is never less than the lower density. If they coincide, we call the 
common value the natural density (or asymptotic density) of S in T. 

Many interesting sets fail to have a natural density (e.g., see exercises). We get a less 
restrictive notion of density by using Dirichlet series. 

For SOT two sets of positive integers, with "^neT divergent, the upper Dirichlet 
density and lower Dirichlet density of 5" in T are defined as 

hm sup , lim inf ' . 

If they coincide, we call the common value the Dirichlet density of S in T. 
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Let us make this explicit in two cases of interest. Recall that ({s) has a simple pole of 
residue 1 at s = 1, so as s — > 1"^, 



oo 



(.-1) J]n- = l + o(l). 



n=l 



Hence if T = N, then the Dirichlet density of S is given by 



n ^ 



lim (s 

if the limit exists. 

Taking logarithms, we see that as s — > 1^, 

oo 

log Cis) + \og(s - 1) = log(s + E^^"" = ^(1)- 

p n=l 

Moreover, EpEr=2P""' = ^(l), so 

J]p-^ = -log(5-l) + 0(l). 
p 

Hence if T is the set of primes, then the Dirichlet density of -S" is given by 



lim 



— log(s — 1) 

if the limit exists. 

Here are some easy facts about density. (If I don't specify natural vs. Dirichlet, I mean 
that the statement holds if you make a choice and use it consistently throughout the state- 
ment.) 

• Any finite set has density in any infinite set. 

• Density is a finitely additive measure: if Si, . . . ,Sm are disjoint subsets of T with 
densities . . . , 5^ in T, then their union has density Si + ■ • • + 5^ in T . Corollary: 
two subsets of T whose combined density exceeds 1 must have infinite intersection. 

• If yS" has density 5 in N, then for any positive integer n, nS — {ns : s & S} has density 
5/n. 

I can't help mentioning a fun example of the additivity of densities. Let a, (5 be positive 
irrational numbers with l/a-\-l/ (5 — 1. Put 

Sa = {[na\ : n e N} 

Then Sa^Sp have natural densities 1/ a, 1/(3. The fact that these add up to 1 is explained 
by the beautiful result (Beatty's theorem) that Sa, Sp are disjoint and their union is N! (If 
you've never seen this before, I recommend this as an amusing exercise.) 
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Lemma 2. Let S CT be subsets ofN such that S has natural density 5 in T. Then S also 
has Dirichlet density S in T. 

Proof. See exercises. (The converse is false; also see exercises.) □ 

To prove Theorem 1, we will prove the following. 

Theorem 3. For any positive integers m, N with gcd(m, N) = 1, the set of primes congruent 
to m modulo N has Dirichlet density 1/4>{N) in the set of all primes (hence is infinite). 



3 L-functions and discrete Fourier analysis 

For X a Dirichlet character of level N, we can write 

oo 

logL(s,x) = J]5^X(P>-"^ 

p n=l 

for Re(s) > 1; as s — > 1"^, we have 

p 

For X nonprincipal, we know that L{s,x) is holomorphic and nonvanishing at s = 1, so 

p 

whereas for xo the principal conductor of level N, we saw above that 

Exo(p)p"' = -Iog(s-1) + O(l). 
p 

At this point it may be clear how to proceed: form a certain linear combination of the 
logL(s, x) to isolate J2p=m{N)P~^ y compare the asymptotic contributions of — log(s — 1). 

The fact that we can do this amounts to what is sometimes called discrete Fourier 
analysis; if you prefer, it is the representation theory of the finite abelian group (Z/A^Z)*. 

Theorem 4 (Discrete abelian Fourier analysis). Let G be a finite abelian group, and let G 
be the character group (or dual group^ ofG, i.e., the group of homomorphisms G — C*. 

(a) The order of G is equal to the order of G. 

(b) (Orthogonality of characters) If Xi-iX2 £ G, then 

( \~~r\ J 1*^1 x\ = X2 

^ 1° XI 7^X2. 
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(c) If 91,92 e G, then 

J2^^9i)x{92) 



91 ^92 
91 + 92- 



Proof, (a) li G = Gi x G2, then clearly G = Gi x G2. Since every finite abelian group G 
is a product of cyclic groups, we may reduce to the case where G is cyclic, and then 
the result is clear. (For G = (Z/iVZ)*, we can make this more explicit: we can use 
the Chinese remainder theorem to split N into distinct prime-power factors, then use 
the fact that (Z/p^Z)* is cyclic unless p = 2, in which case it is {±1} times a cychc 
group.) 

(b) We saw this argument once before, but here it goes again: the left side is invariant 
under multiplication by Xi{h)X2{h) for any h & G, because there is no difference 
between summing over g or over gh. If Xi 7^ X2, then we can make that multiplier 
different from 1 by choosing suitable h. So the sum vanishes if xi X2- If Xi = X2, 
each summand is equal to 1 because characters of finite groups takes values which are 
roots of unity. 

(c) See exercises. 

□ 

So now it is clear what to do: given a choice of m coprime to iV, taking sums of x over 
all Dirichlet characters of level A?", we obtain 

X] ^ ^ ^TTv) S S X{p)x{m)p-' + 0(1) 

= p-' + 0{l) 

p=m (N) 

as s — > 1+. On the other hand, 

^ExMlogL(«,x) = T7^1og^(«,Xo) + -7^ E xRlogL(.,x) 

^ log(s-l) + 0(l). 



m) 

This yields Theorem 3. 



4 The prime number theorem in arithmetic progres- 
sions 

The proof of Dirichlet's theorem only uses information about the behavior of the L{s,x) 
near s = 1. Using the fact that L{s,x) 7^ on the entire hue Re(s) = 1, we can prove a 
much stronger result. 
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Theorem 5 (Prime number theorem in arithmetic progressions). Form, N positive integers 
with gcd{m, N), the set of primes congruent to m modulo N has natural density 1/0(A^). 
In other words, the number of primes p < x with p = m (mod N) is asymptotic to 
as X — > oo. 

Proof. Given what we now know, this is a straightforward adaptation of our proof of the 
prime number theorem. For x ^ Dirichlet character of level N, define 

^x(^) = '^x{p)^ogp. 

p<x 

Given a choice of m coprime to A^, put 

p<x:p=m (AT) 

As in the proof of the prime number theorem, if we prove that the improper integral 

4>{N)d^{x)-x 



i: 



dx 



converges, we may then deduce that '&m{x) ~ '^(m^ ^ desired. 
It suffices in turn to check that for x principal. 



i: 



M^) - X 



converges, and for x nonprincipal. 



i: 



x"^ 



x'^ 



dx 



converges. The former is an immediate consequence of the corresponding fact for (which 
we proved in the unit on the prime number theorem), since d and i^^ differ in only finitely 
many terms. For the latter, sec exercises. □ 

As with the proof of the prime number theorem, we get very little information about 
the error term, i.e., the difference between the actual number of primes p < x with p = m 
(mod A^) and the asympotic count -^^i^- That becomes a problem if, for instance, we 
want to know how long it takes to find one prime in an arithmetic progression. To address 
this, we must first get better results on zero-free regions for the L(s, x), then make a better 
analytic argument to take advantage of the improved analytic information. We turn to this 
in the next few lectures. 
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Exercises 



1. Prove Lemma 2. (Hint: use partial summation.) 

2. Let S be the set of positive integers which have first digit 1 when written in base 10. 

(a) Compute the upper and lower natural density of S, and verify that S does not 
have a natural density. 

(b) Prove that S has a natural Dirichlet density, and compute it. 

Optional (not to be turned in): generalize to an arbitrary base b > 3. Even more 
optional: prove the analogous result for the set of primes with first digit 1 in base b. 

3. Prove that there exists a constant c such that 

- = loglogx + c + o(l). 

p<x 

(Hint: you established asymptotics for X^p<j. on a previous homework. Apply 
partial summation.) 

4. (a result of Mertens; tricky, optional) In the previous exercise, prove that 

c = 7 + $^(log(l-p-')+p-'), 
p 

where 7 is Euler's constant. Then deduce that 

log x 

p<x 

5. Deduce point (c) of Theorem 4 from points (a) and (b). (Hint: form the matrix A with 
rows indexed hj g & G, columns indexed by % G G, and entries xio)- Then compare 
AA* with A* A, where * denotes conjugate transpose. Or if you prefer, prove that the 
dual of G is canonically isomorphic to C) 

6. Prove that ^ dt converges for x nonprincipal, by applying the Taubcrian theorem 
from the unit on the prime number theorem. (Hint: use the fact that L{s,x) 7^ 
for Re(s) > 1 to argue that —L'{s,x)/L{s,x) is holomorphic in a neighborhood of 
Re(s) > 1. There will be an extra term to deal with, just as there was a term I 
neglected in the original notes from the prime number theorem unit; see the corrected 
notes online.) 
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